On the Use of Spatiotemporal Visual Attention for Video Classification
نویسندگان
چکیده
It is common sense among experts that visual attention plays an important role in perception, being necessary for obtaining salient information about the surroundings. It may be the “glue” that binds simple visual features into an object [1]. Having proposed a spatiotemporal model for visual attention in the past, we elaborate on this work and use it for video classification. Our claim is that simple visual features bound to spatiotemporal salient regions will better represent the video content. Hence, we expect that feature vectors extracted from these regions will enhance the performance of the classifier. We present statistics on sports sequences of five different categories that verify our claims.
منابع مشابه
An Enhanced Spatiotemporal Visual Attention Model for Sports Video Analysis
Inspired by the human visual system, visual attention (VA) models seem to provide solutions to problems of semantic image understanding by selecting only a small but representative fraction of visual input to process. Having proposed a spatiotemporal VA model for video processing in the past, we propose considerable enhancements in this paper, including the use of steerable filters for 3D orien...
متن کاملRecognition of Visual Events using Spatio-Temporal Information of the Video Signal
Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...
متن کاملA spatiotemporal model with visual attention for video classification
High level understanding of sequential visual input is important for safe and stable autonomy, especially in localization and object detection. While traditional object classification and tracking approaches are specifically designed to handle variations in rotation and scale, current state-of-the-art approaches based on deep learning achieve better performance. This paper focuses on developing...
متن کاملSpatiotemporal saliency for video classification
Computer vision applications often need to process only a representative part of the visual input rather than the whole image/sequence. Considerable research has been carried out into salient region detection methods based either on models emulating human visual attention (VA) mechanisms or on computational approximations. Most of the proposed methods are bottom-up and their major goal is to fi...
متن کاملImmediate Effects of Different Screen Sizes on Visual Fatigue in Video Display Terminal Users
Background: Computer usage has rapidly grown. This is because it helps to resolve problems, i.e., encountered in daily life by individuals. Various monitor screens that have been developed affect the userchr('39')s eyes. Screen size is one of the relevant impacts. Thus, this study compared the immediate effects of two computer screen sizes on visual fatigue in Video Display Terminal (VDT) users...
متن کامل